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Description 

1 Background of trie Invention 

1 .1 Field of the Invention 5 

The invention concerns electronic messaging in general 
arxl electronic mai in particular. 

1 2 Description of the Prior Art 

A major armoyarKO in the conventional mai systm is 
junk maa. As electronic mail has grown in availabiity and 
popularity, funk electronic mail has become a pr otim as 
we*. Indeed, ttw ease wHh which an e-mail message 
may t>e sent to many reqpients may eventuafy make 
jink e-mail an even worse problem that juik convm- 
tkKxalmail. 

The pr k)r art has attempted to deal with 0^ junk e- 
mail problem by means of mail filers in an e-n^ redp- 
ient's kx:al e-mail system Such a Wer sorts inccmhg e- 
mail lor the recipient into categories determined by the 
redpeni The laier siirply scans each e-raB message 
as it reaches the recipient and determines what category 
it should be placed in. One category is of course *dfe- 
cardT. Messages whch the filer places in t^at category 
are automaticany dscarded. Prior-art fitters have had 
varying degrees of ^elfigence; some have snvfy 
worked with lists of source addresses and have sorted 
accorcfing to the source of the message: others have 
used keywords provided by tt>e redpiem to sort;witholh* 
ers, f inaBy. the liter obsenres how the reqpier4 sorts lis 
emai tor awhie and is then able to sort 11 a sirnbr teh- 
ipa For detais about mail titers, see Peter W. Fdte and 
Susan T. Dumais. "Personafized information defivery: an 
analysis of information f itenng methods*. Comnvnica- 
tions oi the ACM. vol. 35. na 12. Dec. 199?. ppi 5I«; 
p.K. Gilford. aWL Baklwin. S.T BeifirK Lixasserv 
*An arcNteciure for large scale information systems*, in 
PfOceetSngs Tenih Symposium on Operating System 
Phndptes, (Ocas Island. Wash.. Dec 1985), fpi 161- 
170: E, lajtz. KV. Kleist Relzow. and K. Hbetrahg. 
*MAFIA • An active maimer ager4 for an tnteTiger^ doc* 
wnent processing supporT. in Multh User bi^ffaoes and 
Appfy:ations. & Gibbs andh A.;^L Vemjn-Shxart Eds. 
^k)nhHoBand, 1990.pp. 16-32; TW.Malone.lCaOanl 
FA. Twbak. BroMfSt M.D. Cohen. Tntdligeit intor- 
matfon sharing systems*. Common. ACM 30. 5 (May 
1987) 390^102; S. PbflacK 'A niel)ased message fifter- 
ing system*. ACM Trans. Off, fnf. Syst 6. 3 (JWy 1988). 
232-254. R Maes. * Agents that RedOce Wbrk arvf fofor- 
matfonOverfoad*. Conwmn. ACM 37 (7) {My 1994), ppi 
31-40. A problem with al such titers is that sorting for 
another person is drffcult even for a human bekig. and 
if a tater is going to be usefii, » cannot do much worse 
ttian a iHiman woUd. 

One ol the reasons for tfie jurk mail is that present- 
day e^fl systems reqiire t«t recipients be addressed 
by e-mai addresses, fo order to ensure that an e-mal 



message wfl reach everyone who n^j^ possibly be 
interested in it the sender typicaty uses a ist of 
addr esses wfiich includes those who might be inter ested 
but includes many others as wett. For everyomfgt those 
actually interested, ttte e-mai is of coiirseiLtfik rnail. 

What is needed to reduce the anrxRrt of juiii mail is 
a techrvc^ which pernrvts a sender to use sornelNng 
adcfition to the e-mail addr ess to specify the kinds of peo- 
ple wtx) are to actuafiy receive the e-n^ and per nvts a 

titer to the information provided by theservJertofeter 
the mai so tfiat only tfwse Ignds of people actuary 
receive it 9 is an obtect of the inventkyi (fecfosed heretfi 
to provide such a technique and thereby to reduce the 
.amouitofM^e-mai received by a user of the e-mail 
systeniL 

2 Summary of the Invention 

The invention reduces the amount of junk e-mail 
r eceived by a user of the e^nai systemby adcSng a recip- 
ient specifier to an e-msd message. The redpient spec- 
ifier non-address information to fi^ther specify the 
recipients in the group to wfiom the message is sent who 
shoiAl acfiualy r eceive ff)e messaga The mai filer for 
a given rec%)ient has access to information alxxit that 
reqpient and uses tfiat information togetf>er with ttie non- 
address information in the e-mai message to determra 
whether the message shoUd be protnded to the given 
reqpienl I 9>e non-ad*ess information and the ^for- 
mation about the redpient indcate that »)e rec^- 

er< should not receive the message; me fifter does not 
provide it 

In another aspect d 9^ inventioa tie sender's mail 
titer does 9)e fltering. The sendenr provides a recipient 
^pecTier which uses non^addkess information to spedty 
potential reqpients to the mai titer, to ««s aspect, how* 
ever, the sender's mai fiter has access to information 
about the possi3te recipients and uses tfiis infornration 
togeltier wit) the non-add^ess information to determra 

the poteritiaf rec<)ienls to whom the inessage shoiAf be 
sent 

The first and second aspects of tfie invention are 
combined in a forlher asped of tfie iriverition. nanriely a 
system for locating expert in tfiee^system. tothis 
systenv tfie sender spec9ies an area of e39>ertise by 
means of a ist of keywords whkh are relevant to tfie 
area. The fist of keywords is mduded in a redpier4 spec- 
iier in the message. The rroi fiter for a potential rec^ 
er4 has access to the document ties of the potential 
reqpient and to a list of the e-mai messages sent and 
received by the potential recipient The mai fiter uses 
the document ties to determine ttie reopienTs areas of 
expertise. » ttie keywords in the rec%)ient spedRer rrafoh 
one of tile areas of expertise; the mai nter provides the 
emai message to the potential reqpient: if not. the mat 
fiter uses the fist of e^mai messages to d^emine corr. 
respondents of the the potential redpient who may have 
the ar ea of expertise ^pecTied in the rec%3ient specTier 

and fonvards the rnessage to tfK>seoorf espondents The 
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maH later cl eac^ pciential feci(>(^ 

vides the message tq fte redpieni further sericte a 

ral message to the sender of the message, who thus 

knoAfS exactfy who r^cerved the message. 

Other obiects and ad/antages of the apparatus and 5 
methods dtsdosed herein wl be apparent to those of 
ordkiary slol in tfie art i4)on perusal of the Ibflowvtg 
Drawing and Detailed Descnplion. wherein: 

3 Brief Descflpiion ol the Drawing 

FIG. 1 is a Ngh4e^ block diagram of apparatus 
enixxJying the irvertion; 

FIG. ? is a (Sagram of user model 1 13 in a preferred 
embocfimeni: 

FK3. 3isa<fiagramof coaesporx5entmodels 111 in 
a preferred errtxxinnent: and FIQ. 4 is a diagram of 
data stactues used by maH later 109 in a preferred 
entxxfimer<. 

Refer ence numbers m the Drawing have two parts: 
the two least-signrrcarl (figits af e the r^jnriber of an itm 
in a figure; Ote remaining dgAs are the number of tie 
Figue in which Oie (tern first appears. Thus, an item witfi 
the reference nunte 201 first appears in FIG. 2. 

4 Detailed Des$np6on of a Preferred Embodimefil 

The following Detaied Descrpfion be^ with an otf^ 
view of the invent and then describes in detaa how the 
irivention is inplernerited in apparatus to locate expertise 
in an e-mail systera 

Overview of ttie invention: FIG. I 

FKx 1 shMs a hi^h-^evel overview of apparatus 101 
which ennt>odte fie inwenGoa Apparatus 101 is 
errployed m a netMofc 103 wNch connects a nurter of 
users lOS{a..n). Netwok 103 may be a network such as 
fntemet or a commerdal e-mai nelworK or it may l>e an 
e-maa system which cornmuttcatesl>etween users of a 
single convsuler systeoi Each user 105 is conneded to 
network 103 by means of a Gnk 107 over which user 105 
can send and receive e-mai messages. A mail item of 
the type used in the invertion is shown at 1 1 9; nrcui it em 
119 is a standard e^nai message except for two addi- 
tional conponenls: 

1. rec^ent specfier 121 whkh uses non-address 
information to fulher describe the redjpienls wfK> 
slKxAl receive ttie e^msMl; and 

2. referral fist 127. which is a Sst of poterHial rec^)l- 
enis who passed 9)e e-mai on and of recfxents Id 
wtiom the emi was provided 

Reqptem spedHer 1 21 has twoparts. reqpienl t>pe field 
123. whkh general y indk:ates how recpienC specTier 
121 is to be irtferpreledL and reqpienl descrption 125. 



wT^ich contains the non-address information wf^ is 
actually used lodetermine whether mal item 119 is to 
be provided lo a given reqpienl ^ 

A user 105 who wishes to redtx:e iNlfinourit of )ur^ 
e-maa he receives has a mail fiher 109 as part of Ns e- 
mal systera When an e-mai item 119 is sent to user 
105*5 address, mai filter 109 interprets recipient spect'- 
f»r 121 todetermr>ewf)ether mai item 1 19 is to be pro- 
vided to user 105(n). In interpreting recipient specilier 
109. mail liter 109 enploys user model 113. which is 
data that provides a model of user lOS(n). If rec^)ient 
descrption 125 speoftes a recipient which is of the same 
lond as that specified by user model 113. man later 109 
adds mafl item 1 19 to fitered mai 1 15 and informs user 
105(n) via interactive user mail interlace 117 that mail 
hasanived. If user I05(n) desires, mai liter I09canlur- 
Iher use the information in refena) Est 127 to irxficate the 
chain of referrals which resulted in the message being 
dkected to user 105(n). In some errixximenls. mai filer 
109 may also use the information in referral list 127 to 
send a receipt 129 which ideoliries the e^mai message, 
the chain of referrals, and user I05(n) to the original 
sender of mai item 119. 

If user model 1 13 does not specify a reqptent which 
is cf tfie same Wnd specified by r eqpient descrption 1 25. 
mai filter 109 looks lo conespondeni models 111 to 
determine where to send mail iem lia There is a cor- 
respondent model 111(m) for each of tser 105(n) s fre- 
quent conespondents. and Bee user model 113. each 
correspondent model 111(m) contains data wfvch mall 
filer lOOcanusetogetherwithredipienldescnption 125 
to detennne which of user 105(n)'s coaespcndents 
should receive mai item 119. Mail titer 109 tf>en adds 
the names and e-mai addresses of those conespond- 
er4s to ref enal fist 127 in mai item 1 1 9 and fonvards mail 
item 119 to those correspondents, i they ki turn have 
mai liters 109. they wfl also liter mai item 119 as just 
described. In a preferred entxxfiment user 105(n) may 
specify how much control he desires over forwardvig. 
Fbnwardfc>g may be conplelely aulomaSc. or mail filer 
109 may preserrt user 105(n) with the Wonralion from 
recvtentdesa^>tion125andafist of the correspondents 
it has found for lonMard&tg and let user 105(n) select 

which of the oonespondenis is 16 receive the forwarded 
letter. 

II user I05(n) wishes to send an e-mai message 
with a reqpienl specifier 121. user 105(n) makes that 
request of mail filer 109. Mai liter 109 uses mierface 
1 17 toobtain infonnation from user I05{n) which if uses 
to make redpient speciier 121. MaH Bier 109 then uses 
recpient specifier 121 wi^ correspondent models 111 
in the manner desor«>ed abo^e to make a Fist d the cor- 
respondents who shoiid receive tfie message: Depend- 
ing on the nplemenlatibrx. maif filer 109 may sinply 
send the e-mai message to those correspondents or 
permituserl05(n)to5electcorrespondenl5fromthefi$l . 
Ihe selected correspondents wil of Gouse t>e placed on 
referral list 127. 
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In FIG 1. mai filter 109 and correspondent modets 
111 and user model tl3^eaitmplefnemedHi9>elocal 
computer system used by user lOS(n). Such an krpl^ 
mentation is actvantageous in ttaX the information in cor- 
respondert models 111 'and user model 113 remains 5 
under the oonM of user i105(n). In other erhbodMnerts. 
however, mai fiter 109 rnay l>e located at any pOtf4 in 
networl( 103. Meed, some ennbodtments may ooniain 
only conespondent models 111. For example, a <tota 
l>aseofciJGlQrv)er information cnigN be used as a corre- 10 
sporxlenl model 1 1 1 . and mai fiter 1 09 might use recp* 
ient descrpfion 125 together with the data base of 
customer irtormaticn to determine which customers 
should receive e-mai abou a new product or service. 

15 

A System tor Locating Expertise 

The techniques d€scrt>ed abo^ are employed in a pre- 
ferred ennbodiment to make a system for locating B^ta- 
tise. The fcA>tving discussion fast explains the uff^ so 
such a system and then presents two different entxxfi- 
ments. 

Using a Con«Niter to Find Mormatlon 

25 

There are baskaBy two ways of firvfing something out by 
using a computer: *a5k a programT and 'ask a person*. 

The firsl mfem all ways of accessing iriormation 
stored orfine. incktfng the ise ol traditional database 
programs: fle indexing and retrieval programs such as » 
gfimpse (by Udi Manber atlMversity of Arizona) or 
Apple's Apple-Search: news fiftering programs such as 
Hoow (SandPbint Corpi); md even more simply the 

use cl tools such as f^ awK. and text ecftors to retrieve 
arxJ view fSes. as 

Tt)e seoondl *ask a person*, covers ways ^ a com- 
puter can be used as a comrnunicafion mednjmt>etween 
people Curently the prime examples are dcctoric 
niail. inctodvigtwmpersoriale-rnaiandrii^^ 
buHetintxiards ar«l newsgroi43& The growing irtegrafion 40 
of conputers and telephortes aBows us to also view 
telephony as a computer^l>ased 
niecSunv Single exarnples ol such integral are teie^ 
pfione addk^ess book programs that run on a persorol or 
pocket corrputer and (fidlnunt)ers for you: rnoresophis- 45 
ticated fS tie explosion in the use of corrputer-basad 
FAX. Tcxlay it is hard to even buy amodem that does not 
have FAX cstpabaity. and by tar the heaviest use of FAX 
is for petsorv%>i>erson corninunicatioa 

There are inherent proUems witti both general so 
approaches to obtaining informatea It has often been 
noted that as tf>e world ol onlir>e l ifoiii u gun sources 
expands, t« *asli a program!* approach suners from the 
problem d knowing where to took. For eiairpfe the 
Mosaic system overcorriesrTiariy of the technicsiprot)- 55 
lems in accessing a wide variety of witormatSon onthe 
Merriet t>y automaficaOy handbig the tow^ievd details 
d cfifferenc comnxjnicafion protocols. It is easy and 
entertaining to bfumse through an erxxmous hyperme- 



dia space. HoM^ever. firxSng an ar^swer to a specif ic ques- 
tion using Mosaic tends to be slow and frustrating, and 
often results in tadure. One response to this problem has 
t>een the atterrpt to design systems that^i^orporate 
lax>vvtedge about the kxation ol inlorrnation. sud) as the 
Information Maoffokt project (by T KirK A. and D. 
Srivastava« of AT&T Bei l.at>s). However, a deeper prob- 
lem remains, thait no solU6ont>ased solely onbuBcfirxj a 
better search-engine can address. This is the fact that 
much valuable inlbrmalion is simply not online, but only 
exists in peopled heads. Furthermore, (here are eco* 
nomic social, and poMcal reasons that much valuable 
iniornnation wil never t>e made pdbMf accessible on the 
Internet or any other network. Indeed, part of the vakiO 
of a piece of iriormation resides in the degree to wtvch 
it is not easily accessUa 

In any large organizafion. determining wfv> is an 
expert onaparfojiar topic isacrudal problem Ihe need 
for expertise tocation ranges from informal siluatkxis. 
such as wtiere I might need to Hnd an expert on laTex 
macros to he^ fix a typesetting problem in a paper Vm 
writing, to formal construction of project teams to meet 
business needs. The range of esqpertise spedficatkyis 
may range from the generic fwhokrxTws about fogic pro- 
gramming r) to the Hg^ speofic (Nvho knows how to 
modify the interrupt vector tiandSng microcode in the 
reboot module of the XZY999 processor?7. 

Onfine cSrectori es of e39)erfee rarely Gdst and when 
they da the information tiat contain is certan to be out 
of date and inoompleie. In fact expertise needs are 
potentially sospedfictfiatiissimplyinrposstjletodeter- 
nine a oomprehensrve set of categories in advance. 
Expertise focatfon is therefore generally an *ask a per* 
son' task, with the al the problems associated with that 
approach outir^ed atxjva 

t-et us corisider for a rnornent how expert focatior% 
actually works wtien t is successful. In a typical case I 
contact a smal set of Goleagues whom i think might be 
tamifiarwi9i the tope. Because each person knows me 
persortaBy. they are qude ikely to respond. Usually ittrie 
of them is exactly ttie person I want: however, they can 
refer me to sorneone <hey IsKw who rni;^ be After tbf- 
k)wing a chain cf referrals a lew layers deep I fraBy fnf 
the person I wart. 

Notethat in this successful scenariolneededtowaSc 
a fine line t>etween a)ritacSng loo few people (and ttvs 
nd finding the true expert) and oonbKrSng too many (and 
eventually making a pest of myselQ. Even in the end t 
might worxier if I nvg^ not have fourxl even a t>et!er 
expert if only I couk) have cast the net a bit wider. I may 
have had difficulty bringpng to mind tiose people I do 
know personaly who Ivave some expert in the deseed 
area If only al of my coleagues enrpfoyed endlessly 
patient assistants that I couki have contacted kii6aSy. 
who would Itave Imwn something atxM tieir bosses* 
areas of expertise^ and wfiocouki have mwered my 
fial queries wfttxxit d^stuting everyone... 

1^ let us conskler how maB filters coUd be used 
to augment the expert tocak)n process. Each person's 
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mailfiltef wouk) aeate a model ol that person's areas of 
interest This moM would be aeated autooiaticaBy by 
using inforinafk)n retrieval (IR) techniques (such as 
in/erted indexes) on ai the documents created arxJ 
received by the user. The user model could bequite large 5 
and detailed, and would be private to the user, that is. 
not stored in a central datatsase. The mail titer woiid 
also create a mich more coarse-grained model of my 
contacts by applying simitar techr^es to all the elec- 
Ironic mail Ittat I exdwige with each person. 10 

When I tiave an expertise location need, I present 
the problem to my mail f9ter as an unstructured teoct 
desaiption. Again using IR techniques, my mai fdter 
selects a medium-to^arge set of my contacts to whom 
the query may be relevant It then broadcasts the query, is 
not to the people themselves, but to their mail filters. 
Upon receipt of the questioa each mail filter checks if its 
owner's user model does indeed prcvide a good match. 
If there is a good match, the mail filter presents my 
request to its owr^er. H the owner's rrxxlel does not match. 20 
txit the model of one of the owner's contacts does, tf^ 
the mail filter can ask the owner if it can provide a referral 
Ftr>alty. if there is no match at aB. the query is silentty 
togged and deleted. A great deal of fflexixifty can t)e buRt 
irto each mail titer, depending upon its owner's prefer- 25 
erKes. For ex^ple. I might allow automatic referrals to 
t>e $Hven to requests that come from my ctosest col- 
leagues. ^ 

This system provides several t>enefits over either 
sencfing personal e-mai to everyorie in order to find an 30 
expert or using rtetnews to Hnd t\e expert. First it is 
largely passive on tfie part of the recipiervts * 9>ey do not 
need to be readkig netnews and wacfrig through dozens 
of articles. Second, queries are txoadcast in a focused 
manner to those who are at least somewtiatikeiy to fvKf as 
them of interest Thffd. users are shidded from seeing a 
large numt^er of completely irrelevant messages; each 
mailfiiter 109 may processdozensof messages for every 
one it)e user seesw FraBy. messages that a user does 
see do not come from "out of the blue', txil rather are 40 
tagged with a chain of referrals from colleague to col- 
league. 

Or)ereasontot>efieve that thesyst em just descr bed 
wouki t>e usefil in practice is that il t)asically models the 
manner in which expertise kxration actuaOy worl^ noMr «5 
(O. Krackhardt and JLR. Hansoa 'Vrtormal Networks: 
The Company BeNnd the CharT. Harvard Business 
Review. July- August 1993)^ wf^ aflowing nKye people 
to t>e contacted without causir>g dlsrif>tion and dtetur* 
trance. so 

Implementation of an Expertise Locator 

A preser^-prefened errtxxSment of the expertise toca- 
tor has been implemented using 0ie network agerte 55 
desai>ed in Coea et aL, Netwofk Agents. US. f^tenl 
appKcationi 08/203.147. tied 2/28/94. In the inpfemerv 
tation. mail flteir 109 is a component of a user agent 
which handles e-mal messages kx its user. Mai filers 



109 are written in the programming language Visual 
Basic, and run on a standard personal computer. Inter- 
actrve user mail interface 117 preserg^ Vie expertise 
locator in mail f9ter 109 to the user as^g anthropomor* 
pNc "laBdng head* that appears in a wihdow on the corn- 
pUer saeen. AB the compuers rimng nnail titers 109 
are r^etworked (cmeritly using the protocol TCP/IP), arid 
can exchange electronic mail with each otfier and with 
any person. A mai titer 109 can also invoke other pro- 
grams to perform various subtasks. 

Each mai titer 109riasacces$ tohwosetsof data 
base ties. The Frst set shown in FIGL 2. implements cor- 
respondent models 111; the second set. shown in FIG. 
3. iTDplements user model 113. Each of the data l>8se 
fies in the two sets is spec^ to and owned by tf)e indi- 
vidual user d mai titer 109. It is important to note that 
we do not assunrte that these ties can be <firectfy 
accessed by anyor>e other than the user and mail filer 
109. 

Conespondent nrvxlels 111 contains the foBowing 
fwe fies: ^ 

• CcOeague Bst 201 which contains entries 203 for 
some d the user's cdeagues. Each entry 203 corv 
tains an identification 205 for the ooB eague and each 
a Kst of keywords 207 descrfoing tfie colleague's 

areas of expertise. 

• An Emai fie 209 wfach contains al of the emai 
2l1(0..n) tf^ the user has sent and received for a 
substantial period of time: typicaBy. the past year or 
several years. 

• An Emai inverted index fie 213 tfial has an entry 
215 lor each word that appears in any emai mes- 
sage Entry 2l5containsaword217anda6st of the 
rmjmbeis of tlie messages in emai fie 209 that corv 
tain tat word. This Idhd of fie can be generated 
using starxSard ii toi i uti on retrieval algoriltwns. 
such as tx)se descrt>ed In (O, Salton. Automask; 
TextPfocesskig, AdcSson-Vfesley 1989). 

• AsenderAeqpienllistfie2211hatha5anenlry223 
tor each message in emai fie 209. The enlry (foiv 
tains ttie identifier of tt)e sender of the cor responding 
message (if other than tfie user) or tfie identifier of 
the recipient ol the conesponding message (if sent 
bf t>e user). 

FXx 3 shows the data t>ase fies used to implement user 
model 113. 

• Itser expertise Sst 301 is a fie contaMng a Kst of 
Keyworfis ttiat descrt)e some of }he userls own 
areas of expertise 

• User fies inverted index 305 is a fie contairwig an 
invertedindexof text ties in tie user'scSrectory That 
is; for every wad 9iat appears in any fie ttte user 
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has stored on the connputef. this tie contains a list 
of the names of the ties contairwig that woid 

In the preferred entodimept. coleague list 201 and user 
expertise Sst 301 are aeated by rnai liter 109 n inter- s 
acfion with user 105{n); the inverted index lies 213 and 
user ties inverted index 305 are aeated automaticaay 
by mail filer 109. This Idnd of very large inverted Mex 
can be quicWy created and searched by the pro-am 
•«^inpse- (U. Manber and S. Wi. X5UMPSE: A Tool to to 
Search Through Entire Fie Systerm.* Usenet Winter 
1994 Technical Conference. San Frandsoo (January 
1994). pp. 23-32). In making inverted ist 305. 
uses a UNO( operating system (UNIX is a trademarlt of 
XOPE N) ut£ty which deftermines whether a fie is a text 15 
fife. In aM6cn. tfie 1^ can specify to GUMPSE wtwch 
directories ol ties or incfividual ties stfe to be indexed. 

A user begins the process of locating m expert n a 
topic by ciicidng on the window for mai filer 109andtyp- 
ing a phrase «iatdescrt>es the general Wnd of request 20 

^ ^ ^ expert-). Mail filer 1 09 Ihen 
prompts the user for a phrase desatmQ the area of 
expertise. Once this is done, mai filer 109 generates 

and preserts for approval a fist of suggested candUates 
for receiving ffte request 55 

The list of candidates is generated by combinvig 

names from two sources. F«st names ste added lhat 

appear in colleague fist 201. such that the words that 

appear tfi ffie phrase descnbing the expertise request 

appear in the list of keywords 207 associated wHh name 30 
205. 

^^^cord^ names are added »»t result from he fbl- 
^»»ingeompuatioa Ttrst for each word that appears in 
the expertise request, mail filter 109 refrieves from err«i 
invefted«xfexfie213aistormessages403(0.j))(Fia as 
4) containing 9ia| word. Next tiefritersecticn of tfie ists 
is confuted, generating a ist of messages 405 each of 
which appem in every one of the premus fet& riext 
fist of messages.405 is a)fnpared against senderi^ec^ 

iertist He 221. and the total nunri)er of messages tt«t 4a 
appearm ist of messages 405 that are from eacA each 
petsori in senderAeppient ist 221 is c^culated. The 

resut is a nameAnessage nuntw pair fet 407 of pafrs 
of Iperson name' and *nuni>er of messages*. FraHy. ist 
407 is sorted accor<fing to -nuTtw of messages-. The 4$ 
20 names wWi the highest nunnber of messages in this 
list are tfien added to me fist of cafKftlate& 

After the fislofcanddates has been approvedby the 
user. rnaB titer 109 makes a reqpient specTier 121 and 

adds it to ttte email message Redpent specifier 121 50 
coritains a redptent type request 121 which spedfies that 
an expert is being requested and expertise desaf>tion 
401 is used as reqpient descrption 12Si 

The message fravefe through «»e network and 
arrives at the compUer system6(4 of the fec^)ieniSL 5$ 
Eachreqpientmai filer I09notesreqpientspeciner 121 
specifying that an expert is being requestedl remoires 

the e-mail message froni ihe inoonwig rnaa stream, and 
processes i as foQows: 



Rrst f»e ^Kxds in eiqMr^ description 401 corv 

to>ed in Jhe messages recpert specifier 121 ve 

malct>ed against the reapienTs user expertise ist 301. 

n tie words appear in fist 301. then ma^er 109 

assumes tfiat Ms reqMOSt is appropriate lor ttie reop«4 
tosee 

tf the words in the phrase do not match agakist tie 
conerts ol iser expertise Est 301. mai titer 109 uses 
user ties inverted index fie 305 to match tie pfvase 
against t>e cortents of al of tie reqpienrs ties wtidi 
are indexed in fie 30S. This rnalciiing can be effoenty 
perlomned using the program -GUMPSE' menticned 
above I tie ruTter of matdies is greater then a tvesh- 
old nuTt>er {eg, nKsre tian 10 ntatches), tie recipiercs 

mai fiter 109 deterrnines tiat ti^ request is Ekety to tie 
appropnate lor the rec^ent 

1 1^ recf)ierit^ rnai filer tius deterinnes n edher 
way tiat tie message is appropriatei it uses user rroi 
irterlace 117 to make t)e tie message ^^spear on tie 
reqpiefirs con^^er saeea The reqpient is tien gvm 
t>e option of 0 responcfing alirmatively back to tie 
send^^responcfing negatwiely bad< to ttiesendw; cr 
(ii) referring tfie reqpest to someone eise V tvs fir^ 
option is selected, the reqpier«^ mai titer 109 aeales 

a ist of candidate reqpiertsasdesGnbed itove and t« 
process is repeated. 

As is apparent from tie foregoing descrptioa tie 
prelwedennbodfcnefit of tie expertise tocatorkKieases 
its efficiency by ifiing twostage oonesporxtent modeib 
1 11 and user models 1 13. The first stage is tie expicd 
descriptions ol expertise contained in coleague fet 201 
and user expertise ist 301; tie second stage is tie 

inverted indexes: inverted index 213 »ito erny fie 209 
and inverted index 305 into tie tie user^s text ties. The 
algoritvns first use tlie expertise fists 201 afld30i»ml 
tien tiey may in addttion use tie inverted nlexes! 

Example II: EMianced Yeiow Page Service 

The general techniques descHbediribove can be ^]pied 
to marv (iffereni Mnds of tasls. The general ^sfmOt 

is useM wfien tie foRowing corxStions hokl: 

1. >bu wish to contact a large number of people, 
witiout necessariy broadcasting messages to eve- 
ryone in tie world. In tie expertise tocationexarr^ 
tie iser agert fie^ped deterrrine a prefifiwiary ist cf 
cancfidates based on a matching schema Other 

ways of deCemmng whom to send tie message to 
are also usefuf. h the exarrple below, tie recperts 
are simply taken to be a fixed fist of tie sender's 
frierxis arxl coQeagues. 

2. Mmi wart trie message ycu send to ortfy be sem 
by peoplelo whom is it very Beefy torefewt. ki order 
to avoid being disnjv>tiva To mat end. you want tie 
message you send to eifsfidty indicate tie corxfi*- - 
tions under which which i should be tatoi to be rd- 
evani Note tiat tie corvvxifiation of relevancy 

rc» on tnfo ii natkjrt that is private to tie recyierCh 
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the previois muiple. ihe render iTKficate^ 
eral condHions of relevancy by recpient type lieW 
123 (thus indicalinQ the general kind of processing 
to be performed $y the recipient's mai titer 109) and 
the Mrds in recf)ieritdesaip6on field 125 desert)- 5 
ir>g the Kind of expertise requred (thus pro^ng the 
paraineters to that processirtg). Another way ol say* 
ing this is that the sender pro-activeiy determines the 
general manner in wtiich the message is lo be em 
fitered. Note that this isdHferertt from earfier work w 
on mait f dtenr>g. wtwi) always assumes that the 
fecipientoH a message is oorppletely responstile for 
estabfisNng the conditions for f Utering Cit any), and 
the sender is completdy 'passive' with regard lo f i* 
tering. ,5 

We iRustrate these core points with the fbnowing 
'EnharKod Yellow Page* serviceL The basic idea is to 
provide a service that assists people in cbtainir^g one or 
mete personal recommendations atxM a professional 20 
service or business. The system would wortc as foBows. 

A customer contacts the Enharxed YeUow Page 
Service (EVPS) asking for a nurber of a particuiar sen^- 
ice (e.g.. a flower defKrery service; an autobody shop, a 
roofer, etc.). The contact with the EVPS couki t>e made 2s 
by many possible means of oommunicatiorv irKkxiing 
telepfxm an oir^fine service^ an irtferr^ Mosaic/HTTP 
server, or elecflfbr^ mail: altematrvefy. the EVPS soft- 
war e and (^rectory could even be dtelrtMted to users and 
run entirely on their personal conputers. 30 

The EYPSgpves one or more possRjie numbers. The 
custorttercaritf>enasktheEYPSH)hetpin obtaining one 
ormorepersona/recommerxlalions about the service or 
busmess. 

To obtain the recommendations, the EVPS ftfslcorv 35 
siders people from a fst ol frierxls or colleagues of the 
customer. (Or^e way to obtain tt^ 6st is tsy simply asking 
ttie customer to register frierxis. famBy, or colleagues but 
there are also less intrusive ways of doing this, such as 
by keeping tiack of people wHh whom the customer fre- ^ 
quently communicates.) 

Mow. the Key idea is that the EVPS does not simply 
oonlact every person on the ist txit rather only contacts 
those people that fwe dealt with tf)e particular service 
or business nurnber in the last couple of months. There 4s 
are at least two ways in wt^ch this lend of "serxJer pro- 
active litering' can t>e done: 

1. The EVPS contacts mai titer 109 for each friend 

or colleague, indkatirtg the name and teleplKxie so 
riumber for the service for wl^ a recornn^eridation 
is desired. Mail f9ters 109 that have been trusted 
with ttieir owiw's telephone records andfor records 
ol business transactions can determine whether 
their owner has dealt with that company. If so. they 55 
pass the request on to the Mner. 

2. II the EVPS has cSred access to the telephone 
records of the friends and colleagues (vvhich is the 
case if ^ EVPS is tfrplemented by a program run- 



ning in a long-distarKe r^etwork itseff)* twi it checks 
f)e phorie records itself to detenrnine tf^ Kst kiends 
and ooleagues that have caBed that^pmpany 

TTusw instead botherir^ targe yoip ol people. 
9)ere is a careful screening to erasure tt^at only 9iosa are 
corcacted who have had some recent dealir^gs with the 
particiiar service or business. There are various ways 
of how tie EVPS can complete the process. The least 
irtrusive woy would t>e to sirrply leave a message with 
some ol tie selected people saying "Mr. or Ms. X would 
be irteested in any opinion or recommendation sboUl 
service Y. f^ease contact X at or leave message at 
nuTt>er Z. This request expires at midnight* 

^fole tiat this kind ol "pro-active* mal filtering can 
also be rr^eniented by having tie user send a message 
dvBCty to someone^ maS flier 109. The message 
header woiJd incfode a directive sayir^g 'pass on to user 
it he or she has corrtaded service X at least twice in tie 
last tvee months.* Upon receijpt ol the message, mail 
fiter 109 w9 now filter tbe«message t>ased on tt)e 
indxled directive. Again, note the differerKe witi the 
avreni forms of mail-filtering, wtiere filtering is inder 
complele cortiol of tie reqpient. and tie sender does 
rvt QFve drect instructfons to tie fitering program 

Such a system naturally raises many privacy issues 
tiai can t>e addressed, for example, you may not nec- 
essarty let t>e person seeldng the recommendation 
loicwr who gets tie request-fbr-advk:e messaga That 
way. people would ncA feel obftged to respond. Alsa tie 
identify of the requester coukS be protected by sirrply 
having a message saying 'A tiend would fike an opinion 
or reoonvnendation about servk:e Y* In tiat case tie 
EVPS would orty reveal the kfentity ol tie requester 
orve the reqpient agrees to respond. 

Conduskm 

The brcQoing Detailed DescrpHon has cSsdbsed to 
tx)se skOed in the computer and nefworiong arts how 
norvaddress redpienl inlorrnation in an e-mai inessage 
and a mai filer wtikii indudes a riiodel ol the recipieni 
may t>e used to reduce the amount ol juik e-mail 
recerved by tie reqpient and how tie non-adckess redp- 
iert iriforrnaSon and a mai filter wliich incfodes rnodels 
of tie sender's correspondents may t>e used to reduce 
tie anvxjnt ol e-mal. sent Ixf a user. The Detailed 
CescrzKCn has further disctosed how tie above tech- 
niques may t>eusedtoconstruct an expertise kxator and 
has rSsctosed the best mode presently known to tie 
invertos for inplementing the exp^tise focator. 

K wfl t>e immediately apparent to ttx)se skiBed in the 
oonpuler and networking arts tiat tie prviqpfes of tie 
ir>/ertion rnay be used iri ariy situation wtiere a rnail liter 
has access to information wtikii enabfes it to respond to 
rvn-addressinforma6onaboutttiepo4er<ialrecipierils^ . 
an e-m^ message. It wi8 t>e furtier apparent thai rrariy 
tectviiques rnay f>e used to coristruci rnodels of tie cor- 
responienis andreqpiertfs for use tyy the mail filers The 
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modets may t>e %wf%)HB fists of kfiywords. 9>ey inay be 
inverted fSes. they may be data bases, or ihey may be 
any other arrangement off :da!a which permits t^e nr^ 
f Mar to determine from Ihei model and me norvaddress 
tntormation whether the potential recipient shoOd act^ 5 
aDy receive the message, jit wil furtfier tte app^ert to 
those skilled in the art tiat the location of 9>e mai filter 
in the nehAK)rfc is a matter of design choice FSers wNch 
are located on the same computer system as »ie recp- 
ient have better access to recipient iniormatiorv whie w 
those which are located closer to the sender are more 
efficient at redtjong tie total amount of nelworii taffia 
All of the above being the case, the foregoing 
Detailed Descrption is to be understood as being m 
every respect iBustrative and exemplary, but not restric- is 
tive. and the scope of the invention disdosed heretfi is 
rx>t to be determkied from \he Detaled Des c r^ to x. but 
rather from the clams as interpreted accordrig to the ful 
breadth permitted by the law. 

Ctatms 

1. Apparatus for aiitomaticalyfiriiiting the redpents of 
a message sent via a mai system implemented in a 
computer system, the apparatus comp isi ng. 25 

reppient specifying means in tie message 
wtiich i«es non-address inlbrmation to specify the 
redpients ol ih^tnessage; and message fSering 
'weans in the computer system having access to 
recipient inlbrmation contained thereh dboul at ao 
least one off the potential recipierts and indudbig 
nieans responsive to the norvaddress i it u n m ti tfi 
and to the reqpient information for provkfing the 
messagetothepoten6^reqpientiflherKvvad»ess 
infbrmation and tf»e reqpient infai m afat together 35 
indicate that the potential recipient is to receive the 
message. 

2. The apparatus set form in daim1furtitfcoriY)ns- 
ing: 40 

means for analyzing ties befonging to the 
potentialreciper4tooblahtherecf)ientirtforn^^ 

X The apparatus set forth in dakniwherm: 

tfie inon-address information speciies an 45 
area of expertise; and Ihe reqpient wfonimion >id^ 
cates an area of expertise of the potenti^ rec^)ienL 

4. The apparahis set forth in davn 1 wh^en: 

tt>e message ffltering mear>s is associated so 
with a servf er of ttie noessaga 

5. Ihe apparatus set fortfi in claim 1 wheren: 

the message fitermg means is associated 
with the potential reqpienl of ftemessageL 55 

6- The apparatus set for«i in cttnndaim 2. dam 3. 
cfaim 4. or claim 5 wherein: 

the reqpient infonmafion Mudes flutfier 



rec^ent kto m daiui ^boul o«her potential recp- 
ents:and 

tf)e message fitering means further uses the 
non adckess infarmation and the firtsd|edpien| 

inforn>alion to deterrnir^e wliich of tf>e otar poteritiai 
recipients is to receive tt^ message 

7. Ihe apparatos set forti in daim 1 wherein: 

the message fitering means furtier indudes 
means for sending a referral message to a source 
ot the message wt»itf)e message fitering means 
provides the message to the user. 

8. The apparatus set forti in datm 7 whereh: 

tfie referral message contains an dentifica' 
tionof tfieuser. 

9. TheapparatossetfortfiinciaimTwheren: 

the message is received by a pforafity off 

users; 

the message indudes infionnaticn specffymg 
ttie users wtx> received tfie message: ml 

tlie relerral message fbrttier cor4ains the 

inbrrnatipn speci^^ the users vvfio received the 
message 

101 Theapparatossctfortiind^Gwfteren: 

the message includes information speci^ng 
tfie users «ta received 0ie message; ml 

the message fitrafion means adds informa- 
tion about tfieotier potential reqpiertfs to the kilor- « 
mafon speolying the users who received the 
message 
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